Phoneme recognition using time-warping neural networks
نویسنده
چکیده
This paper proposes a novel neural network architecture for phoneme-based speech recognition. The new architecture is composed of five time-warping sub-networks and an output layer which integrates the sub-networks. Each time-warping sub-network has a different time-warping function embedded between the input layer and the first hidden layer. A time-warping sub-network recognizes the input speech warping the time axis using its time-warping function. The network is called the Time-Warping Neural Network (TWNN). The purpose of this network is to cope with the temporal variability of acoustic-phonetic features. The TWNN demonstrates a higher phoneme recognition accuracy than a baseline recognizer composed of time-delay neural networks with a linear time alignment mechanism.
منابع مشابه
Novel Objective Function for Improved Phoneme Recognition Using Time-delay Neural Networks. Vii. Conclusion and Future Work Iv. Phoneme and Viseme Coding
In this paper we show how recognition perfor-mance in automated speech perception can be significantlyimproved by additional Lipreading, so called “speech-read-ing”. We show this on an extension of an existing state-of-the-art speech recognition system, a modular MS-TDNN. Theacoustic and visual speech data is preclassified in two sepa-rate front-end phoneme TDNNs and com...
متن کاملImproving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM
Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...
متن کاملIsolated Voiced Digit Recognition Using Inductive Inference
This paper proposes the use of inductive inference "decision trees" for isolated digit recognition. The aim of this research is to demonstrate that inductive learning can provide an alternative approach to existing automatic speech recognition techniques such as Dynamic Time Warping (DP), Hidden Markov Modelling (HMM) and Neural Networks (NN). The construction of the decision tree is based on C...
متن کاملContinuous Speech Phoneme Recognition Using Dynamic Artificial Neural Networks
Phoneme classification and recognition is the first step to large vocabulary continuous speech recognition. This step represents the acoustic modeling part of such a system. In hybrid speech recognition systems phoneme recognition is made by artificial neural networks (ANN’s). The main objective of this paper is the investigation of dynamic ANN’s, namely the Time-Delay Neural Networks (TDNN) an...
متن کاملWavelet Transform Speech Recognition Using Vector Quantization, Dynamic Time Warping and Artificial Neural Networks
In this paper we investigate the performance of the Discrete Wavelet Transform (DWT) with Dynamic Time Warping, Vector Quantization and Artificial Neural Networks for speaker-dependent, isolated word recognition. Wavelet Transforms have demonstrated good time-frequency localization properties and are appropriate tools for the analysis of non-stationary signals like speech. Moreover, unlike LPC,...
متن کامل